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Abstract. Once supersymmetry is found at the LHC, the question arises what are the fundamental 
parameters of the Lagrangian. The answer to this question should thereby not be biased by as- 
sumptions on high-scale models. SFitter is a tool designed for this task. Taking LHC (and possibly 
ILC) data as input it scans the TeV-scale MSSM parameter space using its new weighted Markov 
chain technique. Using this scan it determines a list of best-fitting parameter points. Additionally 
a log-likelihood map is calculated, which can be reduced to lower-dimensional Frequentist's profile 
likelihoods or Bayesian probability maps. 

PACS. 02.50.Fz Stochastic analysis - 12.60.Jv Supersymmetric models 



1 Introduction 

The search for a Higgs boson, or to find an alternative 
to such a fundamental scalar, are the main goals of the 
LHC, which will start operation next year. However, 
the Higgs-boson mass is quadratically divergent with 
the cutoff scale of the theory, and the question about 
possible ultraviolet completions of the Standard Model 
arises. Such a completion should also provide an an- 
swer for the second unsolved question in high-energy 
physics, the existence of cold dark matter. 

One possible, and particularly attractive, exten- 
sion of the Standard Model is supersymmetry. Rs pa- 
rameter space has been constrained by various sources 
over the last years. Collider data from LEP and TeVa- 
tron have put stringent bounds on the masses of the 
superparticles both via direct and indirect searches, 
like the mass of the lightest Higgs boson, and indirect 
measurements like the anomalous magnetic moment 
of the muon have put further restrictions on the pa- 
rameter space. Also there is no one-to-one correspon- 
dence of observables and parameters. This is especially 
true once we take loop-corrections into account. At 
the same time as few assumptions as possible should 
be imposed. They should instead be inferred from the 
available data. 

The program SFitter [1] is designed to solve the 
task of mapping up to 20-dimensional highly complex 
parameter spaces onto a large set of observables of 
different quality, which can be highly correlated. In 
this proceedings we discuss the techniques and options 
of SFitter. The experimentally well-studied MSUGRA 
parameter point SPSla j2] is thereby used as an ex- 
ample to illustrate the features and physical results of 
this parameter point. 



2 Tools and Techniques 

SFitter can use data input from both high-energy col- 
liders and low-energy constraints. Possible choices in- 
clude kinematic endpoints and thresholds which ap- 
pear in invariant-mass distributions, mass differences 
and masses themselves. Furthermore, it is possible to 
use branching ratios and cross sections. The latter ones 
are normally associated with large errors, so one would 
rely on them only when there are no other types of 
measurements available. But from a technical point of 
view large error bars pose no problem to SFitter. R is 
even possible to consider data from e.g. ATLAS and 
CMS measurements separately. 

This data is then compared to the theoretical pre- 
dictions. Starting from a parameter point the physi- 
cal spectrum is calculated by a choice of three spec- 
trum generators, SoftSUSY [3], SuSPECT [5] or ISAS- 
USY [4. . Next-to-leading-order cross sections for LHC 
or a future ILC are calculated by the program Pros- 
pino2 6 and branching ratios can be included via links 
to Msmlib [7] and SUSY- HIT [8]. Also the dark matter 
content of the relic density is readily available by an 
interface to micrOMEGAs [9J. The communication of 
parameters and results between the different programs 
is performed by the SUSY-Les-Houches-Accord [TU] 
data format using the implementation of SLHAio [11] . 

Comparing the experimental data di and the the- 
oretical prediction di, where the index i runs over the 
different data, a likelihood value is assigned to every 
point in parameter space. Hereby we follow the RFit 
scheme of CKMFitter as described in Ref. [T^] and as- 
sume the theoretical errors cr' heo as box-shaped. This 
scheme interprets theoretical errors as a lack of knowl- 
edge on a parameter. As long as the deviation between 
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theory and experiment is within the theoretical error, 
this must not have any influence on the total likeli- 
hood. In combination with the experimental error <r^ xp 

2 

the total log- likelihood log C = — \ is given by 
[O for \d 1 -d l \< cr* ho ° 

r ' {(■' i-r y a* i* (1) 

The experimental error itself is a combination of three 
different sources. All three are considered to be gaus- 
sian and therefore are summed quadratically. The sta- 
tistical error is assumed to be uncorrelated between 
different measurements. The first systematic error orig- 
inates from the lepton energy scale. It is taken to be 
99% correlated between different observables. The sec- 
ond one has its source in the jet energy scale and is 
taken with a 99% correlation as well. 

To reconstruct Lagrangian parameters a scan of 
the parameter space is performed. This is not only im- 
portant to obtain the correct solution, or possibly a 
set of solutions. It is also necessary to extract error 
estimates on the parameters from the data. To accom- 
plish this goal a choice of three different methods is 
available, which can be freely combined. The first one 
is using a fixed grid. Secondly, Minuit [13] is included 
as a minimum finder, which employs a steepest-descent 
hill-climbing algorithm. The third option uses the tech- 
nique of Weighted Markov Chains, which are described 
in more detail in the remainder of this section. 

Markov chains are defined as a sequence of points 
which are the result of a stochastic process. The spe- 
cial, Markov, property which characterises them is that 
the conditional probability of each point only depends 
on its direct predecessor, but not on any other pre- 
vious, or future, points in the chain. In SFitter the 
Metropolis-Hastings [14] algorithm is used for choos- 
ing the next point, which works in the following way. 
In the first part of the algorithm a new point is sug- 
gested based on the current one using a probability 
distribution function (PDF). The PDF can be freely 
chosen as long as it satisfies the property that the prob- 
ability from being at point x and proposing x' is the 
same as being at x' and proposing x. It can for exam- 
ple be chosen flat, then the Markov-Chain algorithm 
has no dependence between points at all and reduces 
to a simple Monte-Carlo fitting. A good performance 
was found using a Breit-Wigner-, or Cauchy-, shaped 
function. This type of function has more pronounced 
tails than a gaussian distribution and provides a better 
balance to avoid random-walk behaviour. The second 
part of Metropolis-Hastings consists of the acceptance 
stage. It is decided whether the suggested point is ac- 
cepted or rejected based on a potential, which in our 
case is p-. So if the log-likelihood of the suggested 
point is larger than that of the old one, it is always 
accepted, else, it is accepted if the ratio of the two log- 
likelihoods is larger than a random number r chosen 
uniformly between and 1. In all other cases the old 



point is added to the Markov chain another time: 

{^sugg 
if log£(x sugg ) > log£(x old ) 
or log£(x SU gg)/ log£(a; id) > r, 
Xoid else. 

(2) 

As a rule of thumb, an acceptance rate of 25% is gen- 
erally considered optimal. The resulting Markov chain 
has the property that the density of points is propor- 
tional to the potential, i.e. to \ which can then be 
used to obtain likelihood maps by binning the points. 
To do this binning we have developed a new algo- 
rithm |15j based on previous work by Ferrenberg and 
Swendsen |16j . This one does not just take every point 
with a weight of one, which corresponds to the clas- 
sic way. As the function we want to plot is the same 
which has been used in the acceptance decision, this 
information can be reused. We therefore weight the 
points with the potential function taking care that no 
double-counting is done and points where the likeli- 
hood vanishes are properly taken into account. 

It was shown [17] that Markov-chain techniques 
can scan high-dimensional parameter spaces very ef- 
ficiently. They have been extensively applied to con- 
strain the MSUGRA parameter space from current ex- 
perimental precision data [18lll9j . 

3 Reconstructing SPSla 

With these possibilities SFitter provides the relevant 
frequentist or Bayesian results in three steps: First, 
a fully exclusive log-likelihood map of the complete 
parameter space is calculated. In a second step the 
best local likelihood maxima are obtained from the 
map and ranked according to their likelihood. Last, the 
map is projected onto lower-dimensional spaces down 
to one-dimensional distributions. This can be done in 
both a Frequentist way, which yields a profile likeli- 
hood, and in a Bayesian way. Here the unwanted di- 
mensions are marginalised away and a prior needs to 
be specified for this. 

In general, no specific model for supersymmetry 
breaking should be assumed a priori. Instead this should 
be inferred from the data. However, to demonstrate 
the features of SFitter we consider here the MSUGRA 
scenario using the parameter point SPSla. Besides the 
MSUGRA parameters mo, m l / 2 , tan/3, Aq and sgn/i 
also the top-quark mass nit is taken as a free param- 
eter because of the large uncertainty on its value. A 
smeared dataset is generated from the measurements 
which can be performed in this parameter point. Using 
30 individual chains with 20000 points each a Weighted 
Markov run is used to produce the likelihood map. For 
the best points an additional Minuit fit is performed 
to find the exact position of the minimum. With this 
procedure we find four distinct minima as shown in Ta- 
ble [1] The best point corresponds indeed to the true 
solution where the deviations turn out to be compat- 
ible with the error bars. The second solution is given 
by very similar values for the continuous parameters 
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0.09 
1.50 
73.2 
139.5 



mo JTii/2 tan/3 Aq /i 



nit 



102.0 
104.8 
108.1 
112.1 



254.0 
242.1 
266.4 
261.0 



11.5 
12.9 
14.6 
18.0 



-95.2 
-174.4 
742.4 
632.6 



+ 



+ 



172.4 
172.3 
173.7 
173.0 



2.17 2.64 2.45 49.6 0.97 

Table 1. SFitter output for the SPSla point in MSUGRA. 
List of the best-fitting parameter points with associated 
log- likelihood. The last line denotes the corresponding er- 
rors for the best-fitting point. All masses are in units of 
GeV. 
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Fig. 1. SFitter output for the SPSla point in MSUGRA. 
Two-dimensional likelihood maps of the mo-m^ plane. 
Upper: Bayesian marginalised plots. Lower: Profile likeli- 
hoods. All masses are given in GeV. 



and a flipped sign of /i. Solution three shows a distinct 
maximum, and the last one again differs from the pre- 
vious one only by sgn fj,. For these last two solutions 
the trilinear parameter takes a large positive value to- 
gether with a slight shift in the top-quark mass. If the 
latter was kept fixed this distinct maximum would be 
much less pronounced. In the last line of Table [1] the 
corresponding errors on the parameters are printed. 

Fig. Q] shows the corresponding plots of the likeli- 
hood map projected onto the mrj-mi/2 plane. The peak 
is clearly at the right position in both the marginalised 
plot and the profile likelihood. The resolution of the 
plots is too coarse so the different maxima are merged 
in a single bin and cannot be resolved here. From there 
it extends in two branches, which reflect the fact that 
extracting masses from kinematic endpoints involves 
quadratic equations. 
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Fig. 2. SFitter output for the SPSla point in MSUGRA. 
One-dimensional Bayesian marginalised plots for tan/3. 
Upper: High-scale prior flat in B. Lower: Prior flat in tan/3. 



Strictly speaking, the usual set of MSUGRA pa- 
rameter is not purely high-scale, as it contains the 
weak-scale quantity tan (3 which explicitly assumes ra- 
diative electroweak symmetry breaking. This can be 
replaced by the mass parameters B and /j, |18j which 
appear as B[i in front of mixed terms of the type 
HiH®. n can then be eliminated by the requirement 
that the correct low-energy Z-boson mass is repro- 
duced. This distinction is important for the margin- 
alised plots, where a prior as a measure in the pa- 
rameter space must be specified. Fig. [5] shows the dif- 
ferent results of the two choices. In the upper plot a 
purely high-scale model is chosen as has been done in 
the previous plots. The prior is taken to be flat in B. 
This corresponds to a prior in tan (3, which falls of as 
l/tan/3 2 in leading order. This behaviour is clearly 
visible in the plot. It is dominated by noise and the 
prior and as small values of tan (3 as possible are pre- 
ferred. In the lower plot a prior which is flat in tan (3 
has been chosen. Here the plot still shows a significant 
noise, but the maximum is in the correct place. For 
profile-likelihood plots this choice does not making a 
difference in the resulting likelihood. It does however 
have an indirect influence via the Markov chain scan- 
ning. For the PDF a measure in the parameter space 
must be defined which influences the probability for 
suggesting a point. A bad choice of the PDF can then 
lead to a bad coverage of the parameter space. 
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A very important clue for Physics beyond the Stan- 
dard Model is the existence of cold dark matter, which 
in the SM cannot be created in the right quantity. Su- 
persymmetry provides an ideal candidate to solve this 
problem. The lightest neutralino as the lightest super- 
symmetric particle is stable, massive and only weakly 
interacting. That it is indeed responsible for the dark 
matter content of the universe is a hypothesis which 
should not just be simply assumed and used in the fits. 
Using future collider data it is possible to test it. After 
the Lagrangian parameters and their associated errors 
have been extracted, the relic density can then be com- 
puted and compared to the experimental data. In this 
short analysis we ignore the fact that SPSla produces 
a too large density which is excluded and would over- 
close the universe, since we are not interested in the 
central values but in the principal dependence of the 
parameters and the propagation of the errors. 

Using the information on the parameters obtained 
in the previous section we use micrOMEGAs to calcu- 
late an estimate of the relic density i?cDM, SPSia^ 2 = 
0.1906 ± 0.0033. The amount decreases with smaller 
mo and larger tan/3, while the dependence on m\/2 
and Aq is rather weak. This can then be compared to 
the experimental value, which is derived from the mea- 
surement of the fluctuations of the cosmic microwave 
background by WM AP J50| . The current best value is 
fi C DM, oxpfr 2 = 0.1277 ±0.008. Adding possible future 
ILC measurements would mean an additional error re- 
duction in the extracted value by a factor of 10. So the 
accuracy which can be derived from collider data alone 
is well compatible with the experimental precision and 
testing the hypothesis that the lightest neutralino is re- 
sponsible for the dark matter content of the universe 
viable. 



5 Conclusions 

If new physics is discovered at the LHC, the crucial 
task will be to map the possibly strongly correlated 
observables onto a high-dimensional weak-scale La- 
grangian. SFitter with its new weighted Markov chain 
technique has been designed to accomplish such a task. 
It produces both a list of best points and a fully- 
dimensional exclusive likelihood map as output. The 
dimensionality of this map can then be reduced using 
either Frequentist or Bayesian techniques, which yield 
a profile likelihood or marginalised plots, respectively. 
The latter depend on the choice of priors, which can 
have a significant effect when the data cannot con- 
strain parameters sufficiently and demonstrate Bayes- 
ian volume effects. After the reconstruction the param- 
eter set can be used for a prediction of the relic den- 
sity, which can then be compared to the experimental 
value. We find that for SPSla both values have com- 
parable errors. SFitter is despite its name not limited 
to supersymmetry. It can and will be used to study 
further problems involving high-dimensional parame- 
ter spaces. 
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